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Abstract 

Introduction: The TWIST homolog 1 {TWIST!) is a transcription factor that induces epithelial to mesenchymal 
transition (EMT), a key process in metastasis. The purpose of this study was to investigate whether TWIST! 
expression predicts disease progression in a large breast cancer cohort with long-term clinical follow-up, and to 
reveal the biology related to TWIST! mediated disease progression. 

Methods: TWIST! mRNA expression level was analyzed by quantitative real-time reverse polymerase chain reaction 
(RT-PCR) in 1,427 primary breast cancers. In uni- and multivariate analysis using Cox regression, TWIST! mRNA 
expression level was associated with metastasis-free survival (MFS), disease-free survival (DFS) and overall survival 
(OS). Separate analyses in lymph node-negative patients (LNN, n = 778) who did not receive adjuvant systemic 
therapy, before and after stratification into estrogen receptor (ER)-positive (n = 552) and ER-negative (n = 226) 
disease, were also performed. The association of TWIST! mRNA with survival endpoints was assessed using Kaplan- 
Meier analysis. Using gene expression arrays, genes showing a significant Spearman rank correlation with TWIST! 
were used to identify overrepresented Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes 
(KEGG)-annotated biological pathways. 

Results: Increased mRNA expression level of TWIST! analyzed as a continuous variable in both uni- and 
multivariate analysis was associated with shorter MFS in all patients (hazard ratio (HR): 1.17, 95% confidence 
interval, (95% Cl):1.09 to 1.26; and HR: 1.17, 95% CI: 1.08 to 1.26; respectively), in LNN patients (HR: 1.22, 95% CI: 1.09 
to 1.36; and HR: 1.21, 95% CI: 1.07 to 1.36; respectively) and in the ER-positive subgroup of LNN patients (HR: 1.34, 
95% CI: 1.17 to 1.53; and HR: 1.32, 95% CI: 1.14 to 1.53; respectively). Similarly, high TWIST! expression was 
associated with shorter DFS and OS in all patients and in the LNN/ER-positive subgroup. In contrast, no association 
of TWIST! mRNA expression with MFS, DFS or OS was observed in ER-negative patients. Genes highly correlated 
with TWIST! were significantly enriched for cell adhesion and ECM-related signaling pathways. Furthermore, TWIST! 
mRNA was highly expressed in tumor stroma and positively related to tumor stromal content (P <0.001). 

Conclusions: TWIST! mRNA expression is an independent prognostic factor for poor prognosis in LNN/ER-positive 
breast cancer. The biological associations suggest an involvement of the tumor microenvironment in TWISTTs 
adverse role in breast cancer. 
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Introduction 

Breast cancer is one of the most frequently diagnosed 
cancers and the leading cause of cancer related deaths 
among females of the Western world [1]. Patients do not 
die from the primary tumor, but from metastases, which 
already are resistant or acquire resistance to systemic 
therapy. Metastasis is a complex, multi-step process in 
which malignant cells undergo sequential molecular 
changes helping them to disengage from primary sites, 
intravasate into blood vessels, extravasate to distant 
organs and finally colonize secondary sites. Each of these 
metastatic steps is affected by aberrant expression of a 
variety of transcription factors and among them, TWIST 
homologue 1 (TWIST1) is considered an important regu- 
lator of disease progression [2]. 

The TWIST1 protein, encoded by the TWIST1 gene, is 
a member of a large protein family called basic helix- 
loop-helix (bHLH) transcription factors [3]. Most family 
members contain a bHLH domain, which enables it to 
target specific DNA sequences and thereby allowing 
them to regulate developmental processes in many 
organs and tissues. TWIST1 plays a key role in the regu- 
lation of embryogenesis, gastrulation and mesoderm for- 
mation during early embryonic development of 
Drosophila and many other species [4,5]. An autosomal 
mutation pattern in the TWIST1 gene leads to Saethre- 
Chotzen syndrome, a genetic condition characterized by 
premature fusion of skull bones affecting symmetrical 
growth of the head and face [6]. In children, TWIST 1 
protein is involved in adequate maturation of the skull 
and spine bones and normal development of arms and 
hind legs. 

More recently, TWIST 1 protein has been implicated in 
various carcinomas, including breast cancer, where it 
plays a role in metastasis through activation of a biologi- 
cally latent developmental process called epithelial to 
mesenchymal transition (EMT) [7,8]. In the EMT pro- 
cess, malignant epithelial cells undergo cytoskeletal 
changes, including the down-regulation of epithelial mar- 
kers, such as E-cadherin and co-expressed catenins and 
up-regulation of mesenchymal markers, such as vimen- 
tin, N-cadherin and fibronectin. EMT transformed malig- 
nant cells are more motile and can be more efficient in 
invading the surrounding tissues and as a result metasta- 
size to distant organs [9] . 

In this large retrospective study of 1,427 primary breast 
cancer patients, we determined whether the TWIST1 
gene expression level is a prognostic marker. To avoid 
possible confounding effects of therapy and to study the 



natural course of the disease, we particularly focused on 
the subgroup of 778 lymph node-negative (LNN) patients 
who did not receive any adjuvant systemic therapy. Addi- 
tionally, to understand the biological context of TWIST1, 
we have identified, using available Affymetrix U133A 
gene expression data [10,11], genes and biological path- 
ways co-expressed with TWIST 1. By doing this, we iden- 
tified a clear link between the tumor microenvironment 
and TWIST1 expression in clinical breast cancer. 

Methods 

Patients 

This study was approved by the institutional medical 
ethics committee of the Erasmus MC (MEC 02.953). The 
study was performed in accordance with the Code of Con- 
duct of the Federation of Medical Scientific Societies in 
The Netherlands [12]; informed consent was not required. 
Wherever possible, the study has been reported in line 
with the Reporting Recommendations for Tumor Marker 
Prognostic Studies guidelines [13,14]. Breast cancer tissue 
specimens from 1,427 female patients with primary oper- 
able breast cancer were included in the study. Tumor sam- 
ples were originally submitted to our reference laboratory 
from 25 regional hospitals for biochemical assessment of 
steroid hormone receptor status. Guidelines for primary 
treatment were similar in all hospitals. All available frozen 
breast tumor samples of at least 100 mg from female 
patients with invasive breast cancer who entered the clinic 
during the period of 1978 to 2001, and from whom 
detailed clinical follow-up data were available were pro- 
cessed. Moreover, these patients had no distant metastasis 
within the first month after primary surgery and no prior 
cancer except basal skin carcinoma and cervical cancer 
stage I. ER, PGR and HER2 (ERBB2) levels were assessed 
by quantitative real-time reverse transcriptase PCR (qRT- 
PCR) as described before [15-17]. Lymph node involve- 
ment, tumor size and grade were extracted from the 
pathology reports as obtained from the hospitals. Primary 
surgical treatment was lumpectomy in 628 patients (44%) 
or modified mastectomy for 799 patients (56%). One thou- 
sand and nine patients (71%) received adjuvant radiother- 
apy at the thoracic wall only (n = 639) and either at the 
thoracic wall and nodal stations (n = 290) or at the nodal 
stations only (n = 80). Thirty- three percent of the patients 
had Tl tumors. The median age of the patients at surgery 
was 55 years (range, 23 to 89 years). Routine post-surgical 
follow-up and the definition of the endpoints of MFS, DFS 
and OS were described before [11,18]. During follow-up, a 
local/regional relapse for 127 patients was not counted as 
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an event in the analysis of MFS. The median follow-up 
time was 104 months (range, 4 to 262 months) with 707 
and 669 events in the analyses of MFS and OS, respec- 
tively. Study design, patient inclusion criteria and patient 
subgroups included in the analyses are given in Figure 1. 
Other relevant clinico-pathological characteristics are 
listed in Table 1. 

Tissue processing and estimation of the amount of 
epithelial tumor cells 

Tissue processing was done as described in detail pre- 
viously [15,19]. In brief, 20 to 60 cryostat sections of 
30 urn size corresponding to 30 to 100 mg weight were 



cut from frozen tissues. Three 5 um sections were cut 
before, in between and after cutting the sections for RNA 
isolation, and were stained with hematoxylin and eosin 
(H&E) to assess the amount of tumor cells relative to the 
amount of surrounding stromal cells. The amount of 
nuclei evidently of epithelial tumor cell origin relative to 
the amount of surrounding stromal cells was estimated 
with a 100-fold magnification in 10 different areas cover- 
ing the area of each of the three H&E sections. Only speci- 
mens with at least 30% of the nuclei of epithelial tumor 
cell origin and distributed uniformly over at least 70% of 
the section area were included. Like before [16], these esti- 
mates were used to dichotomize our tumor cohort at the 



Patients selected for TWIST1 mRNA expression analysis 

(n = 1,561) 



□ 



Included from the fresh frozen tissue bank; 

• Female patient with primary breast cancer 

• Diagnosed between 1978 and 2001 

• At least 100 mg frozen tissue available 

• No distant metastasis within the first month 
after primary surgery 

• No prior cancer except basal skin carcinoma 
and cervical cancer stage I 



Excluded (n = 134) 

failed the inclusion criteria due to 

• Missing clinical follow-up (n = 61) 

• Node-negative but treated (n = 7) 

• Missing data for ERBB2 (n = 1 7) 

• Missing value for tumor cells (n = 5) 

• Tumor cells less than 30% (n = 44) 



Included for TWIST1 mRNA expression analysis by quantitative reverse-transcriptase PCR 


(n 


= 1,427) 


• Primary surgical treatment (n = 1,427) 


• Adjuvant radiation treatment (n =1,009) 


• Lumpectomy (n = 628) 


• Thoracic wall only (n = 639 ) 


• Modified mastectomy (n = 799) 


• Thoracic wall and nodal station (n = 290) 




• Nodal station only (n = 80) 



Lymph node-negative patients 
(n = 778) (Affymetrix data n = 311) 

I 



Lymph node-positive patients treated with 
hormonal and/or chemo therapy (n = 649) 



ER-positive (n = 552) 
(Affymetrix data n = 193) 



I 

ER-negative (n = 226) 
(Affymetrix data n = 118) 



Study end points 

Metastasis-free survival (MFS), disease-free survival (DFS) and overall survival (OS) 



Figure 1 Study design, patient inclusion criteria and patient sub-cohorts analyzed. Associations of TWIST! mRNA expression levels with 
tumor aggressiveness were evaluated in all patients and in LNN, LNN/ER-positive and LNN/ER-negative patients. The LNN patients in this study 
did not receive adjuvant systemic therapy. 
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Table 1 Association of TWIST1 mRNA expression with standard clinical and pathological characteristics of patients and 
tumors 



Characteristics 



No. patients 



TWIST1 mRNA expression (Median value*) 



P-value 



Age (years) 

<40 

41 to 55 
56 to 70 
>70 

Menopausal status 

Premenopausa 

Postmenopausal 

Tumor size 

pT1, <2 cm 

pT2, >2 to <5 

pT3, >5 + pT4 

Lymph nodes involved 

0 

1 to 3 
>3 

Grade 

Poor 

Good/Moderate 
Unknown 
Tumor histology 

IDC 
ILC 

Medullary 

Mucinous 

ER status (mRNA) 

Negative 

Positive 

PGR status (mRNA) 

Negative 

Positive 

ERBB2 status (mRNA) 

Negative 

Positive 



530 
-171 

238 

603 
824 

463 
807 
157 

778 
288 
361 



230 
397 

957 
112 
31 
39 

360 
1,067 

572 
855 

1,187 
240 



0.025 
0.028 
0.030 
0.020 

0.027 
0.027 

0.031 
0.025 
0.028 

0.026 
0.025 
0.030 

0.026 
0.027 
0.029 

0.027 
0.044 
0.016 
0.008 

0.021 
0.029 

0.023 
0.029 

0.026 
0.032 



0.04 + 
0.21* 

0.009 1 

0.007 1 

0.47 1 

<0.001*' 5 
<0.001 11 

0.06 + 

<0.001 + 

<0.001 + 



Abbreviations: IDC, invasive ductal carcinoma; ILC, invasive lobular carcinoma. 

* Expression values after normalization to our set of three reference genes [B2M, HMBS and HPRT1). 

t P Spearman rank correlation test. 

$ P for Mann-Whitney test. 

1 P for Kruskal-Wallis test. 

§ TWIST! expression was compared between IDC and ILC tumors. 



median level of 70% tumor cell nuclei in stromal-rich (SR) 
(primary tumors containing >30% stromal components) 
and stromal-poor (SP) (primary tumors containing at least 
70% tumor cells). 

RNA isolation, cDNA synthesis, and quantification of 
specific mRNA species 

RNA isolation, cDNA synthesis, quantification of specific 
mRNA species and quality control checks were done as 
described previously by Sieuwerts et al. [15]. qRT-PCR 



was performed in 25-pd reaction volume in a Mx3000P 
Real-time PCR System (Agilent, Amsterdam, The Neth- 
erlands) using a commercially available gene expression 
assay for TWIST1 (Hs00361186_ml) and for VIM (Hs 
Hs00185584_ml) from Applied Biosystems/Life Tech- 
nologies (Nieuwerkerk aan den IJssel, The Netherlands). 
For EPCAM mRNA measurement, a combination of the 
following primers and probe was used (F: 3'-AGT TTG 
CGG ACT GCA CTT CA, R: 3'- AAT ACT CGT GAT 
AAA TTT TGG ATC CA, FAM-labeled MGB probe: 
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AAG GAG ATC ACA ACG CGT). Primer sequences for 
estrogen receptor-a (ESR1), progesterone receptor 
(PGR), ERBB2 and the reference genes {HMBS, HPRT1 
and B2M), as well as how PCR reactions and validations 
were carried out to ensure PCR specificity, were done as 
described before [15,17]. Levels of mRNA expression 
were quantified relative to the set of reference genes as 
described before [15]. 

Laser capture microdissection 

Laser capture microdissection (LCM) was performed 
according to a previously published method [20] . Briefly, 8 
urn sections of fresh frozen tumor tissue were cryosec- 
tioned and pasted on UV sterilized polyethylene naphtha- 
late (PEN) covered glass slides (P.A.L.M. Microlaser 
Technologies, Bernried, Germany). The slides were air 
dried for 10 sec at room temperature, fixed in ice-cold 
70% ethanol and washed shortly in Milli-Q water. The 
slides were then stained for 15 sec in Haematoxylin (Klini- 
path 4085.9005, Duiven, The Netherlands) and washed 
again in Milli-Q water. Subsequently, the slides were dehy- 
drated twice in RNase-free 50, 70, 95 and 100% ethanol for 
30 sec each, and air-dried. Next, LCM was performed 
directly on the stained sections. From consecutive cryosec- 
tions of a tumor tissue, both the stromal content and 
tumor epithelial cells were collected separately in triplicate 
in P.A.L.M. tube caps containing 25 ul of tissue lysis (RLT 
+) buffer (Qiagen 1053393, Venlo, The Netherlands) using 
a P.A.L.M. LCM device, type P-MB (P.A.L.M. Microlaser 
Technologies AG). Both stromal and epithelial cell frac- 
tions were then spun down into 0.5-ml Eppendorf Protein 
LoBind tubes (Eppendorf, Hamburg, Germany) containing 
225 ul RLT+ buffer and used for RNA isolation. 

Immunohistochemistry (IHC) 

Sections of 4 um were cut from formalin-fixed paraffin- 
embedded tissues, pasted on glass slides and dried over- 
night at 37°C. The slides were dewaxed in xylene solution 
and hydrated in a series of different percentages of 
alcohol (100%, 95%, 75% and 50%). The antigen was 
retrieved for 40 minutes in a hot water bath (at 95°C ) in 
Tris-EDTA buffer, pH 9.0 (DAKO; S2367, Glostrup, 
Denmark) and cooled for 20 minutes at room tempera- 
ture. After retrieval, slides were immersed in 3% hydro- 
gen peroxide for 10 minutes to block endogenous 
peroxidase activity, and in a DAKO protein-free serum 
blocking solution for 30 minutes to block non-specific 
binding sites. The slides were then incubated overnight 
at 4°C with a mouse monoclonal primary antibody raised 
against a human recombinant fragment of TWIST1 
(AbCam TWIST2Cla, ab50887, Cambridge, UK) and 
diluted 1:100 in an antibody diluent (DAKO). Negative 
controls were made by replacing the primary antibody 
with mouse immunoglobulin at an appropriate dilution. 



Next, the slides were treated with a secondary antibody 
(envision mouse kit, DAKO) and visualized using DAB 
(envision kit, DAKO). The slides were counterstained 
with hematoxylin and dehydrated through graded alcohol 
and xylene. Evaluation of the immunostaining and histos- 
coring were performed according to a previously pub- 
lished method [21]. Briefly, a histoscore of 0 to 300 was 
given to each tumor by multiplying the proportion of 
cells with nuclear staining in a tissue by the intensity of 
the staining as follows; histoscore = (0 x percentage not 
stained) + (1 x percentage weakly stained) + (2 x percen- 
tage moderately stained) + (3 x percentage strongly 
stained). The histoscores calculated in this way were cor- 
related with TWIST1 mRNA expression. The intensity of 
TW1ST1 stromal expression in tumor tissues was scored 
as no, weak, moderate or strong staining. 

Identification of TWIST1 co-expressed genes on 
Affymetrix U133A gene-chips and identification of Gene 
Ontology (GO) terms and biological pathways 

In this analysis we used TWIST1 mRNA expression, 
which was previously measured by Affymetrix U133A 
gene-chips (Affymetrix, Santa Clara, CA, USA) (probe set 
213943_at) [10,11,22]. The mRNA expression levels of 
TWIST1 measured on the Affymetrix- array and by qRT- 
PCR were highly correlated in an overlap of 193 LNN7 
ER-positive and 118 LNN/ER-negative tumor specimens 
using the Spearman-Rank correlation test (Spearman 
rank correlation, Rs = 0.7; P <0.001 for both). Genes 
showing a significant positive or negative correlation 
with TWIST1 were identified by using these Affymetrix - 
array data after all genes were corrected for multiple test- 
ing by applying a false discovery rate (FDR) of 5% [23]. 
Biological pathways and GO terms from genes signifi- 
cantly co-expressed with TWIST1 were identified using 
the ArrayTrack™software (NCTR/FDA, Jefferson, AR 
72079, USA) [24]. The overrepresented pathways and 
GO terms identified by ArrayTrack™ were based on 
Fisher's exact statistics. Details about how ArrayTrack™ 
software functions can be retrieved from the website 
[25]. The gene expression data used have previously been 
deposited in the National Center for Biotechnology Infor- 
mation/Gene Expression Omnibus database entries 
GSE2034 and GSE5327. 

Statistics 

Computations were done with the use of the STATA 
statistical package, release 11.2 (STATA Corp., College 
Station, TX, USA). Differences in levels were assessed with 
the Mann- Whitney U test or Kruskal-Wallis test. In these 
tests, patient and tumor characteristics were used as 
grouping variables. The strengths of the associations 
between continuous variables were tested with the Spear- 
man rank correlation test. TWIST1 mRNA expression 
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levels measured by qRT-PCR were log transformed to 
reduce the skewness (Skewness-Kurtosis normality test, 
P <0.05) and to attain symmetric distribution (Additional 
file 1: Figure SI). The prognostic value of the clinical and 
biological variables, with MFS, OS and DFS as the end 
points in the univariate and multivariate analyses, were 
investigated with the use of the Cox proportional hazard 
model. The hazard ratio (HR) and its 95% confidence 
interval (95% CI) were derived from these models by ana- 
lyzing TWIST1 mRNA expression levels as a continuous 
variable or after dividing their levels into quartiles. The 
proportionality assumption was investigated with a test 
based on the Schoenfeld residuals. Kaplan-Meier survival 
analysis was performed and a log-rank test was used to 
assess a trend of survivor's function across four quartiles 
of TWIST1 mRNA expression levels. All P-values are two- 
sided, and P <0.05 was considered statistically significant. 

Results 

Association of TWIST1 mRNA expression with standard 
clinical and pathological characteristics of patients and 
tumors 

Association of TWIST1 mRNA expression levels with 
clinical and pathological characteristics of patients and 
tumors are shown in Table 1. Tumor TWIST 1 mRNA 
expression was higher in patients with >3 positive lymph 
nodes, and lower in pT2 tumors and in older patients. 
Furthermore, TWIST1 mRNA expression was higher in 
invasive lobular carcinoma (ILC) compared with invasive 
ductal carcinoma (IDC). Mucinous and medullary- type of 
tumors showed the lowest TWIST1 mRNA expression 
levels. In addition, TWIST1 mRNA expression level was 
higher in PG7?-positive and £7?5S2-overexpressing 
tumors, but was not associated with tumor grade and 
menopausal status. 

Association of TWIST1 mRNA expression with prognosis in 
primary breast cancer patients 

To explore the prognostic significance of TWIST1 
mRNA expression in primary breast cancer patients, 
we first performed Cox uni- and multivariate analyses 
(traditional factors in the multivariate analysis were; 
age, menopausal status, number of positive lymph 
nodes, tumor size, tumor grade, ER status, PGR status 
and ERBB2 status) for MFS, DFS and OS as a function 
of continuous TWIST1 mRNA expression levels. In all 
1,427 patients, increasing TWIST1 mRNA expression 
levels was associated with shorter MFS in both uni- 
and multivariate analysis (HR: 1.17, 95% CI: 1.09 to 
1.26; P <0.001 and HR: 1.17, 95% CI: 1.08 to 1.26; 
P <0.001; respectively) (Table 2). To visualize the prog- 
nostic value of TWIST1 mRNA expression in Kaplan- 
Meier survival curves, we divided TWIST1 expression 
levels into four quartiles Ql (low) to Q4 (high) (Figure 2). 



To exclude a possible confounding treatment effect, we 
also evaluated the prognostic value of TWIST1 mRNA 
expression in LNN-patients who did not receive any adju- 
vant systemic therapy. We also stratified these LNN 
patients into ER-positive and ER-negative subgroups as 
these subgroups are considered biologically distinct. In the 
analysis of all 778 LNN patients, an association of increas- 
ing TWIST1 mRNA expression levels with shorter MFS 
was observed in both uni- and multivariate analysis (uni- 
variate HR: 1.22, 95% CI: 1.09 to 1.36; P = 0.001 and HR: 
1.21, 95% CI: 1.07 to 1.36; P = 0.001; respectively) (Addi- 
tional file 2: Table SI). Moreover, within the LNN sub- 
group of patients, the association of increasing TWIST1 
mRNA levels with shorter MFS was confined to the ER- 
positive subgroup of patients (univariate (HR: 1.34, 95% 
CI: 1.17 to 1.53; P <0.001) and multivariate (HR: 1.32, 95% 
CI: 1.14 to 1.53; P <0.001)) (Table 3). There was no asso- 
ciation of TWIST1 mRNA levels with MFS in the ER- 
negative subgroup of patients (univariate HR = 1.05, 95% 
CI: 0.86 to 1.29; P = 0.64) (Additional file 2: Table S2). 
Kaplan-Meier survival curves for all patients and sub- 
groups of patients with LNN and LNN/ER-positive and 
LNN/ER-negative disease, after dividing these cohorts in 
quartiles (Ql to Q4), are shown in Figure 2A-D. Cox 
regression analysis for MFS showed that in the LNN/ER- 
positive group the quartile with the highest TWIST1 
mRNA levels (Q4) had a two-fold increased hazard ratio 
compared with the quartile with the lowest TWIST1 
mRNA levels (Ql) (Table 3). 

We also related TWIST1 mRNA expression levels with 
OS in all patients, in the LNN cases as a whole and after 
stratifying by ER status. Similarly as observed for MFS, 
increasing TWIST1 mRNA expression levels was asso- 
ciated with shorter OS in all patients (univariate HR: 

1.10, 95% CI: 1.08 to 1.26, P = 0.013; multivariate HR: 

1.11, 95% CI: 1.03 to 1.20, P = 0.006), in the LNN (uni- 
variate HR: 1.13, 95% CI: 1.01 to 1.27, P = 0.03; multivari- 
ate HR: 1.16, 95% CI: 1.03 to 1.30, P = 0.01) and in the 
LNN/ER-positive group of patients (univariate HR: 1.26, 
95% CI: 1.09 to 1.45, P = 0.001; multivariate HR: 1.27, 
95% CI: 1.11 to 1.46, P = 0.001). These associations were 
not significant in the LNN/ER-negative group of patients 
(univariate HR: 0.96, 95% CI: 0.78 to 1.18, P = 0.69; mul- 
tivariate HR: 0.98, 95% CI: 0.79 to 1.21, P = 0.85). The 
results are included in Additional file 2: Tables S3-S6. 
Kaplan-Meier survival curves showing OS for all patients, 
and also for the subgroups of patients after dividing them 
into quartiles based on TWIST1 mRNA levels are shown 
in Additional file 1: Figure S2A-D. 

We finally analyzed whether TWIST1 mRNA expres- 
sion levels were associated with DFS in all patients and 
in LNN/ER-positive patients only, and obtained similar 
results as were obtained for MFS and OS (Additional 
file 2: Tables S7-8). 
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Table 2 Model for uni-and multivariate analysis for metastasis-free survival in all patients (n = 1,427) 

Factors No. patients Univariate Multivariate 

HR 95% CI P-value HR 95% CI P-value 



Age (years) 

<40 188 1 1 



41 to 55 


530 


0.92 


0.73 to 1.16 


0.47 


0.88 


0.69 to 1.12 


0.30 


56 to 70 


471 


1.02 


0.80 to 1 .29 


0.88 


0.85 


0.59 to 1.21 


0.36 


>70 


238 


0.80 


0.66 to 1 .06 


0.12 


0.69 


0.47 to 1 .03 


0.07 


Menopausal status 
















Premenopausal 


603 


1 






1 






Postmenopausal 


824 


1.07 


0.92 to 1 .25 


0.35 


1.23 


0.94 to 1 .62 


0.14 


Tumor size 
















pTl, <2 cm 


463 


1 






1 






pT2, >2 to <5 


807 


1.65 


1 .39 to 1 .97 


<0.001 


1.38 


1.15 to 1.65 


<0.001 


pT3, >5 + pT4 


157 


2.80 


2.20 to 3.58 


<0.001 


1.80 


1.39 to 2.33 


<0.001 


Lymph nodes involved 
















0 


778 


1 






1 






1 to 3 


288 


1.79 


1.48 to 2.17 


<0.001 


1.71 


1.41 to 2.08 


<0.001 


>3 


361 


2.92 


2.46 to 3.46 


<0.001 


2.63 


2.19 to 3.16 


<0.001 


Grade 
















Poor 


800 


1 






1 






Good/Moderate 


230 


0.51 


0.40 to 0.65 


<0.001 


0.64 


0.50 to 0.82 


<0.001 


Unknown 


397 


0.89 


0.75 to 1 .05 


0.17 


0.99 


0.83 to 1.17 


0.87 


ER status (mRNA) 
















Negative 


360 


1 






1 






Positive 


1,067 


0.82 


0.70 to 0.97 


0.02 


0.83 


0.67 to 1 .04 


<0.001 


PGR status (mRNA) 
















Negative 


572 


1 






1 






Positive 


855 


0.75 


0.64 to 0.87 


<0.001 


0.87 


0.71 to 1.06 


0.17 


ERBB2 status (mRNA) 
















Negative 


1,187 


1 






1 






Positive 


240 


1.30 


1 .08 to 1 .57 


0.006 


1.18 


0.97 to 1 .43 


0.09 














Addition to the base model 


TWIST1 mRNA level + (Continuous) 


1,427 


1.17 


1 .09 to 1 .26 


<0.001 


1.17 


1.08 to 1.26 


<0.001 


TWIST! mRNA level + 
















TWIST1 quartile 1 


360 1 








1 






TWIST1 quartile 2 


355 


1.16 


0.94 to 1 .45 


0.17 


1.09 


0.88 to 1.37 


0.43 


TWIST1 quartile 3 


357 


1.22 


0.98 to 1.51 


0.08 


1.25 


1 .00 to 1 .56 


0.05 


TWIST1 quartile 4 


355 


1.49 


1 .20 to 1 .83 


<0.001 


1.43 


1.15 to 1.77 


0.001 


\TWIST1 mRNA expression level (log-transformed after normalization to the expression of three reference genes) 


was separately added to the base multivariat 


model that included the factors age, menopa 


usal status, tumor size, 


grade, 


lymph-node status, ER, PGR, and ERBB2 mRNA expression levels, tested both as a 



continuous variable and in four quartiles. 



Investigation of TWIST! related biological pathways in 
LNN breast cancer 

To explore the underlying biology of TWIST 1 -associated 
disease progression in breast cancer, we determined 
TWIST1 co-expressed genes previously measured on Affy- 
metrix U133A gene-chips [10,11]. Since we found TWIST1 
to be associated with poor prognosis in LNN/ER-positive 
patients only, we first decided to include only the LNN/ 
ER-positive samples to examine the TWIST1 co-expressed 
genes. After using a Spearman correlation test followed by 
applying a multiple testing correction at a FDR of 5%, we 



found 1,847 genes to be positively correlated and 1,445 
genes to be negatively correlated with TWIST1 mRNA 
expression (Additional file 3: Table S9A, B). To reveal 
which biological pathways might be associated with 
TWIST1 mRNA expression, we performed pathway ana- 
lyses using the genes co-expressed with TWIST1 and the 
KEGG pathway database as input for the analysis in the 
ArrayTrack™ software package. The three most signifi- 
cantly overrepresented pathways containing these TWIST1 
co-expressed genes were: Focal adhesion pathway (Fisher's 
exact P = 1.0 x 10" ), ECM-receptor interaction pathway 
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Figure 2 Kaplan-Meier survival curves presenting the association of TWIST1 mRNA expression with metastasis-free survival in: A: 
all 1,427 patients; B: LNN patients only, C: LNN and ER-positive patients and D: LNN and ER-negative patients The LNN patients in this 
study did not receive adjuvant systemic therapy. The patients are divided into four quartiles (Q1 (low) to Q4 (high)) based on TWIST1 mRNA 
expression levels after normalizing these to the expression of our set of three reference genes (B2M, HMBS and HPRT1). Patients at risk at various 
time points are indicated. 



(Fisher's exact P = 9.9 x 1CT 7 ), and TGF-beta signaling 
pathway (Fisher's exact P = 2.6 x 10~ 4 ) (Additional file 3: 
Table S10). In addition, GO term enrichments among 
TWIST1 co-expressed genes identified "cell adhesion", 
"extracellular structure organization" and "anatomical 
structure formation involved in morphogenesis" as the top 
three significant GO terms (level >3; gene hits >5; P-values 
<0.001) in the category of "biological processes", and the 
GO terms, "extracellular matrix" proteinaceous extracellu- 
lar matrix and "extracellular matrix part" as the top three 
significant GO terms in the category "protein cellular com- 
ponents" (Additional file 3: Table Sll). 



To evaluate a possible TWIST1 mediated biological dif- 
ference between LNN/ER-positive and LNN/ER-negative 
patients, we also performed pathway analyses using 
TWIST1 co-expressed genes (n = 1,141 positively and n = 
498 negatively correlated genes with TWIST1, respectively, 
Additional file 3: Table S12A, B) measured on Affymetrix 
arrays from LNN/ER-negative patients (n = 118). As 
observed in LNN/ER-positive patients, these analyses iden- 
tified Focal adhesion and ECM-receptor interaction 
pathways as the two most significant pathways. But, in 
contrary to what observed in LNN/ER-positive patients, 
the TGF-beta signaling pathway was not found to be the 
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Table 3 Model of uni- and multivariate analysis for metastasis-free survival in lymph-node negative and ER-positive 
patients (n = 552) 



Factors 


No. patients 




Univariate 






Multivariate 




HR 


95% CI 


P-value 


HR 


95% CI 


P-value 


Age (years) 
















<40 


66 


1 






1 






41 to 55 


187 


0.97 


0.64 to 1 .49 


0.90 


1.05 


0.68 to 1 .64 


0.81 


56 to 70 


179 


0.79 


0.51 to 1.23 


0.29 


0.81 


0.40 to 1 .66 


0.57 


>70 


120 


0.59 


0.35 to 0.99 


0.05 


0.59 


0.27 to 1.28 


0.18 


Menopausal status 
















Premenopausal 


215 


1 






1 






Postmenopausal 


337 


0.76 


0.57 to 1.00 


0.05 


0.99 


0.56 to 1 .74 


0.98 


Tumor size 
















pTl, <2 cm 


251 


1 






1 






pT2, >2 to <5 


276 


1.12 


0.84 to 1 .49 


0.43 


1.09 


0.82 to 1 .46 


0.34 


pT3, >5 + pT4 


25 


1.62 


0.84 to 3.11 


0.15 


1.92 


0.99 to 3.75 


0.06 


Grade 
















Poor 


257 


1 






1 






Good/Moderate 


124 


0.53 


0.35 to 0.79 


0.002 


0.56 


0.37 to 0.85 


0.007 


Unknown 


171 


1.06 


0.78 to 1 .44 


0.70 


1.20 


0.87 to 1 .64 


0.26 


PGR status (mRNA) 
















Negative 


116 


1 




I 








Positive 


436 


0.67 


0.49 to 0.92 


0.013 


0.71 


0.51 to 0.99 


0.04 


ERBB2 status (mRNA) 
















Negative 


475 


1 






1 






Positive 


77 


1.51 


1.06 to 2.15 


0.02 


1.39 


0.97 to 2.00 


0.08 












Addition to the base model 




TWIST1 mRNA level + (Continuous) 


552 


1.34 


1.17 to 1.53 


<0.001 


1.32 


1.14 to 1.53 


<0.001 


TWIST1 mRNA level + 
















TWIST1 quartile 1 


135 


1 






I 






TWIST1 quartile 2 


125 


1.02 


0.65 to 1.61 


0.92 


0.90 


0.57 to 1.43 


0.67 


TWIST1 quartile 3 


149 


1.28 


0.85 to 1 .94 


0.24 


1.09 


0.71 to 1.68 


0.70 


TWIST1 quartile 4 


143 


2.06 


1 .39 to 3.04 


<0.001 


1.87 


1 .25 to 2.80 


0.002 



t TWIST! mRNA expression level (log-transformed after normalization to the expression of three reference genes) 
model that included the factors age, menopausal status, tumor size, grade. Eft, PGR, and ER8B2 mRNA expression 
in four quartiles. 



was separately added to base multivariate 
levels, tested both as a continuous variable and 



third most significant pathway in LNN/ER-negative 
patients (Additional file 3: Table S13). We also performed 
pathway analysis using TWIST1 co-expressed genes in all 
LNN patients irrespective of ER status. Analyses revealed 
only Focal adhesion and ECM-receptor interaction path- 
ways as the two most significant pathways, but the TGF- 
beta signaling pathway was not found to be the third top 
significant pathway in all patients (Additional file 3: Table 
S14). Altogether, these data suggest that the co-expression 
of the TGF-beta signaling pathway with TWIST1 gene is 
more prominent in LNN/ER-positive cancer. 

Association of TWIST1 mRNA expression levels with 
stromal content 

The co-expression analysis suggested an association 
between stromal content of the tumor tissue and TWIST1 
mRNA expression levels. Therefore, we studied the 



association of TWIST1 mRNA expression with tumor 
stromal content in more detail. For this, we performed 
LCM to separately isolate the stromal (S) and epithelial 
tumor cell (T) areas from 10 breast tumor tissues (n = 9 
ESR1 high, n = 1 ESR1 low) with sufficiently micro-dissec- 
tible fresh frozen tissue left. After LCM we measured 
TWIST1 mRNA by qRT-PCR in RNA isolated from both 
the stromal and the epithelial tumor cell fractions of these 
tissues, and normalized the expression levels to our set of 
reference genes. To ensure that our reference gene-set did 
not cause a bias in these analyses, and our LCM material 
was indeed enriched for stromal and epithelial cells, we 
also measured VIM and EpCAM (Epithelial Cell Adhesion 
Molecule) in these fractions (Figure 3A). In these analyses, 
EpCAM mRNA expression was high in the epithelial 
tumor cell fractions (3.4-fold, two-sided paired t-test P = 
0.006) and VIM and TWIST1 mRNA expression were 
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Figure 3 Association of TWIST1 mRNA expression with stromal content of the tumor tissue A: TWIST1 mRNA expression levels in laser 
capture microdissected breast tumor tissues. The stromal and epithelial tumor cell areas were separately analyzed after microdissecting 10 breast 
tumor tissues (n = 9 with high ESR /expression, n = 1, (T6), with low E5R1 expression). After LCM, RNA isolation and cDNA synthesis were done as 
described in the Patients and Methods sections. TWIST! mRNA levels were measured by qRT-PCR in RNA isolated from both the stromal (S) and 
the epithelial tumor cell (T) compartments of these tissues, and levels were normalized to our set of reference genes by the delta (A) Ct method. 
To ensure our set of reference genes did not cause a bias in these analyses, and our LCM material was indeed enriched for stromal and 
epithelial cells, we also measured VIM and EpCAM in these fractions. A two-sided paired t-test was performed to evaluate the differences. B: 
TWISTI mRNA expression in SR and SP ER-positive tumors and C: TWIST1 mRNA expression in SR- and SP ER-negative tumors. For figure sections 
B and C, the mRNA expression of TWISTI was log-transformed and the expression relative to reference genes is presented on the y-axis of the 
box-and-whisker plots. The box-plot shows the five statistics (box bottom-line is the 25 th percentile, solid line in the middle of the box presents 
the median, upper line of the box is the 75 th percentile and the whiskers extend to 1.5 times the inter-quartile range, observations beyond these 
values are plotted, separately). Mann-Whitney test was performed to evaluate the difference between the SR and the SP groups in both ER- 
positive and ER-negative patients (B, C). 



high in the stromal fractions (23.4-fold, P = 0.0001 and 
14.6-fold, P = 0.0004, respectively). As the prognostic 
value for TWISTI mRNA expression is confined to ER- 
positive breast cancer, we assessed whether ER expression 
might be of relevance for TWISTI mRNA expression in 
tumor tissues with a variable amount of stromal content. 
For this, we compared TWISTI mRNA expression levels 
between stromal rich (SR) and stromal poor (SP) tumors 
in the ER-positive and ER-negative subgroups. Notably, 
only in ER-positive tumors, TWISTI mRNA expression 
was significantly higher in SR compared with SP tumors 



(Figure 3B, P <0.001); in ER-negative tumors the difference 
was not significant (Figure 3C, P = 0.41). 

This altogether suggests that TWISTI mRNA is predo- 
minantly expressed in the stromal compartment of breast 
tumor tissues. Besides addressing the issue of where 
TWISTI mRNA is predominantly expressed in breast 
cancer tissues as described above, we also performed 
IHC staining to ascertain the localization of TWISTI 
protein. For this, we stained 20 tumor tissues with differ- 
ent TWISTI mRNA expression (n = 10 with high 
TWISTI mRNA expression, n = 10 with low TWISTI 
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mRNA expression). Apparently, TWIST1 protein is pre- 
dominantly localized in the nuclei of both epithelial 
tumor cells and in almost all non-epithelial cells present 
in the stromal compartment of tissues, such as fibroblast 
cells, endothelial cells and inflammatory cells (Figure 4A, 
B). In these 20 tumors, TWIST1 mRNA expression levels 
were significantly correlated with the nuclear protein 
expression in tumor tissue (Spearman rank correlation, 
Rs = 0.61, P <0.004) (Figure 4C). Additionally, tumor tis- 
sues showing weak/moderate TWIST 1 protein staining 
were mostly ER-negative (Figure 4C). 



Our data suggest that stroma of ER-positive (and not 
ER-negative tumors) is enriched for TWIST1. To address 
this in more detail, we stained 20 primary breast tumor 
tissues (n = 7 ER-negative and n = 13 ER-positive) for 
TWIST 1 protein. We observed that 2 out of 7 (29%) of 
the ER-negative tumor tissues and 10 out of 13 (77%) of 
the ER-positive tumor tissues showed moderate to strong 
TWIST1 protein staining in the stroma. The staining of 
TWIST1 in the other tumors (71% of the ER-negatives 
and 23% of the ER-positives) was weak. These data based 
on a limited number of samples further support our 
qRT-PCR based findings at the protein level that the 
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Figure 4 Representative tissues with IHC staining of TWIST1 protein. A: Strong nuclear staining of TWIST1 protein in an ER-positive tumor 
tissue with high TWIST1 mRNA expression. B: Weak nuclear staining of TWIST! protein in an ER-negative tumor tissue with low TWIST1 mRNA 
expression. C: Correlation of TWIST1 mRNA expression and protein expression in breast tumor tissues (n = 20; n = 10 high TWIST! mRNA 
expression, n = 10 low TWIST! mRNA expression). Red and green boxes represent high and low TWIST! mRNA expressing tumor tissues, 
respectively, whereas empty boxes and filled boxes represent ER-negative and ER-positive tumor tissues, respectively. Spearman rank correlation 
test was used to evaluate the correlation between TWIST! mRNA and protein expression. D: Association of stromal staining of TWIST1 protein 
with ER-positive tumor tissues, (v = negative nuclei of inflammatory cells; ▼ = strong positive nuclei of inflammatory cells; X = strong positive 
nuclei of fibroblast; o = strong positive nuclei of tumor cells; J = strong positive nuclei of endothelial cells). 
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stroma of ER positive tumors is enriched for TWIST1 
expression (Figure 4D). 

Discussion 

The bHLH transcription factor TWIST 1, which is essen- 
tial in developmental processes, such as gastrulation, has 
been shown to be oncogenic in various cancers [26-33]. 
Functional studies pin-pointed a pivotal role for this pro- 
tein in EMT, a fundamental biological process considered 
to be crucial for metastatic spread in various carcinomas, 
including breast cancer. This prompted us to analyze 
mRNA expression of this gene in primary breast tumors 
from a large retrospective cohort of patients with com- 
plete clinical follow-up and address two basic questions. 
First, can TWIST1 mRNA expression in breast cancer tis- 
sue predict disease progression? Second, as we had Affy- 
metrix U133A gene-chips data available for a subset of 
samples [10,11,22], can we reveal which biological path- 
ways are co-expressed with TWIST1 in breast cancer? 

Our current study shows that high TWIST1 mRNA 
expression is an independent marker of poor outcome in 
breast cancer patients. As TWIST1 mRNA expression was 
also a predictor of poor outcome in patients with LNN 
disease who did not receive any adjuvant systemic treat- 
ment, we may conclude that TWIST1 mRNA expression is 
a pure prognostic marker and is associated with the nat- 
ural course of disease progression. This suggests that 
tumors with high expression of TWIST1 are more aggres- 
sive, a finding that fits with experimental data where over- 
expression of TWIST1 resulted in increased extravasation 
during the dissemination of tumor cells [7]. Also in line 
with this is the increased lymph node involvement of 
breast tumors exhibiting higher TWIST1 expression as 
observed by us and others [34] . The first seminal paper on 
TWIST1 in breast cancer also showed higher TWIST1 
mRNA expression in infiltrative lobular carcinoma [7], 
which we also confirmed, though the implications of this 
remain unclear. 

What we further unexpectedly observed was that the 
association of TWIST1 expression with poor prognosis 
was confined to ER-positive cases and not seen in ER- 
negative breast cancer. Similar observations were recently 
made by Van Nes and colleagues [21]. This group, by 
studying TWIST1 expression in a smaller-sized cohort 
using IHC on tissue microarrays, also found significantly 
more cumulative relapses in the ER-positive subgroup of 
tumors. This observation was not significant in their entire 
cohort, which also included ER-negative cases. Using the 
same antibody, we observed, in a subset of formalin-fixed 
specimen available from our cohort, clear nuclear 
TWIST 1 expression. This protein expression was not only 
confined to the nuclei of invasive tumor cells but also pre- 
sent in nuclei of other cells, such as fat cells, fibroblasts, 
endothelial cells and inflammatory cells. Furthermore, 



although based on small numbers, TWIST 1 protein stain- 
ing was high in those specimens in which we had mea- 
sured high TWIST1 mRNA expression in a matched 
fresh-frozen sample of the same tumor, while in speci- 
mens with low TWIST1 mRNA expression the protein 
staining was overall weaker or absent. 

To study the potential biological mechanisms in disease 
progression connected to TWIST1 expression, we identi- 
fied genes co-expressed with TWIST1 in LNN/ER-posi- 
tive tumors from which genome wide gene expression 
data were available. Three related pathways were 
revealed. The first pathway was "Focal adhesion", which 
involves interaction of ECM with integrin-A and -B 
ligands. The second pathway, "ECM-receptor interac- 
tion", is related to ECM and involves collagen, laminin, 
fibronetic, tenascin and integrin-A and -B proteins. The 
third pathway is the "TGF-beta signaling". How these 
pathways, involving well-known cancer-related genes, are 
connected to TWIST1 expression in relation to disease 
progression in LNN/ER-positive cancer warrants func- 
tional validation. A common denominator among these 
three pathways, however, is their involvement in ECM 
remodeling. There is a growing body of evidence showing 
that changes in the tumor microenvironment can pro- 
mote proliferation of the tumor cells and, therefore, can 
influence outcome of patients [35], and our data connect 
this to TWIST1 expression. Furthermore, our work also 
shows that the stroma is a significant source of TWIST 
transcript expression. Indeed, the current data reinforce 
recent observations made by Roman-Perez and collea- 
gues after evaluating gene expression patterns in 72 
breast tissue samples [36]. They identified two distinct 
subtypes, Active and Inactive, in the cancer-adjacent 
extra-tumoral microenvironment of breast tissues and 
demonstrated that TWIST1 was highly expressed along 
with other stromal associated genes, such as VIM, 
ADAMTS2, COL4A2, COL4A1, ITGA7 and ITGB1, in the 
Active subtype. This could suggest a potential involve- 
ment of TWIST 1 in the activation of stroma in malignant 
breast tissue. Furthermore, the Active subtype was asso- 
ciated with expression of TGF-beta induced fibroblast 
activation signatures, and showed a strong association 
with OS among specifically ER-positive patients. Intrigu- 
ingly, also in our data set, the association of TWIST1 
with the TGF-beta signaling pathway appears to be more 
prominent in ER-positive tumors. 

Another recent study also demonstrated that TWIST1 
protein can be found in both compartments of malig- 
nant breast tissue, with a predominant expression in the 
stromal compartment [37]. More importantly, they 
found that, although rare in their hands, nuclear expres- 
sion of TWIST1 protein in the epithelial tumor cell 
compartment to be associated with poor prognosis (OS). 
TWIST1 protein expression in the stromal compartment 
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was not associated with OS, but positively associated 
with a positive ER or PGR status of the tumors. This 
mutual dependence of TWIST 1 expression on the pre- 
sence of a notable stromal component and an ER-posi- 
tive status of the tumor tissue in our and other studies 
is in line with the observed prognostic value of TWIST1 
in ER-positive disease only. Although our study is a cor- 
relative study and cannot reveal causality, it is intriguing 
and points to a novel role of TWIST1 in breast cancer. 

What is surprising about our results, which associate 
high TWIST1 mRNA expression with poor prognosis in 
ER-positive breast cancer and with ECM and stromal 
content, is the fact that up until now the process of 
EMT has been more frequently assigned to ER-negative 
cancer and was rarely observed in ER-positive breast 
cancer. Also, previously studied breast cancer cell lines 
with clear EMT-like features were ER-negative [38]. 
This left us with an apparent contradiction. We, there- 
fore, specifically explored whether markers of EMT pre- 
sent on gene expression arrays [11] were associated with 
TWIST1 expression. Of the eight markers analyzed, we 
found Zinc finger E-box-binding homeobox 1 (ZEB1), 
Zinc finger E-box-binding homeobox 2 (ZEB2), Snail2 
(SNAI2), Smooth muscle actin, also known as Alpha- 
actin-2 (ACTA2) and VIM clearly, positively correlated 
with TWIST1 expression (Rs = 0.34, 0.34, 0.35, 0.42 and 
0.28, respectively; all P-values <0.001; Additional file 3: 
Table S15). This does suggest, in line with the recent 
study of Roman-Perez and colleagues [28] that markers 
of EMT are co-expressed with TWIST1. However, the 
lack of a clear negative association of E-cadherin mRNA 
(CDH1) with TWIST 1 mRNA expression in our data set 
could indicate that EMT may not be fully established. 
Alternatively, and more in line with our findings, is that 
some markers of EMT, such as Vimentin and Smooth 
muscle actin, are also expressed in re-activated stroma 
[39]. Therefore, our findings are a starting point for 
more functional studies to be certain about TWIST1 
possible role in EMT as well as in re-activated stroma in 
ER-positive breast cancer. 

Conclusions 

In summary, we have found that the TWIST1 mRNA 
expression level is associated with small tumor size, 
invasive lobular carcinoma, stromal-rich tumors and 
with PGR and ERBB2 (over)expression. More impor- 
tantly, TWIST1 mRNA expression predicts, indepen- 
dently of the traditional prognostic factors, poor 
prognosis in patients with primary breast cancer. This 
association with poor prognosis is particularly observed 
in ER-positive disease. For this large retrospective study 
that involved 1,427 breast cancer specimens, we made 
use of all clinical material available that met the preset 
criteria described in Figure 1, thus reaching Level of 



Evidence III (LOE-III). To reach LOE-I or -II, a pro- 
spective randomized study or pooled meta-analysis is 
required. Studying TWIST1 co-expressed genes, we 
found that ECM related pathways as well as stromal 
content are correlated with TWIST1 mRNA expression 
in LNN/ER-positive breast cancer. Functional studies 
are, therefore, required to determine the causal relation 
between these pathways and the prognostic value of 
TWIST1 mRNA expression in ER-positive tumors. 

Additional material 



Abbreviations 

ACTA2: alpha-actin-2; bHLH: basic helix-loop-helix; BM2: beta-2- 
microglobulin; CI: confidence interval; DFS: disease-free survival; ECM: 
extracellular matrix; EMT: epithelial to mesenchymal transition; EpCAM: 
epithelial cell adhesion molecule; ER: estrogen receptor; FDR: false discovery 
rate; PGR: progesterone receptor; ERBB2: erythroblastic leukemia viral 
oncogene homolog 2; ESR1: estrogen receptor 1; GEO: gene expression 
omnibus; GO: gene ontology; HER2: human epidermal growth factor 
receptor 2; HMBS: hydroxymethylbilane synthase; HPRT1: hypoxanthine 
phosphoribosyltransferase 1; HR: hazard ratio; IDC: invasive ductal carcinoma; 
IHC: immunohistochemistry; ILC: invasive lobular carcinoma; KEGG: Kyoto 
Encyclopedia of Genes and Genomes; LCM: laser capture microdissection; 
LNN: lymph node-negative; LOE: Level of Evidence; MFS: metastasis-free 
survival; OS: overall survival; PEN: polyethylene naphthalate; Q: quartile; RT- 



Additional file 1: Figure SI. Log-transformed distribution of TWIST! 
mRNA expression levels in the entire cohort of patients. Figure S2 

Kaplan-Meier survival curves presenting the association of TWIST! mRNA 
expression with overall survival in: A: All 1,427 patients; B: LNN patients 
only; C: LNN and ER-positive patients; and D: LNN and ER-negative 
patients. The LNN patients in this study did not receive adjuvant 
systemic therapy. The patients are divided into four quartiles (Q1 (low) to 
Q4 (high)) based on TWIST! mRNA expression levels. TWIST1 expression 
levels are presented relative to the expression of our set of three 
reference genes (B2M, HMBS and HPRTI). Patients at risk at various time 
points are indicated. 

Additional file 2: Tables S1-S8. Table SI: Model for Uni- and 
Multivariate analysis for MFS in LNN patients (n = 778). Table S2: Model 
for Uni- and Multivariate analysis for MFS in LNN & ER-negative patients 
(n = 226). Table S3: Model for Uni- and Multivariate analysis for overall 
survival in all patients (n = 1,427). Table S4: Model for Uni- and 
Multivariate analysis for overall survival in all LNN patients (n = 778). 
Table S5: Model for Uni- and Multivariate analysis for overall survival in 
LNN & ER-positive patients (n = 552). Table S6: Model for Uni- and 
Multivariate analysis for overall survival LNN & ER-negative patients (n = 
226). Table S7: Model for Uni- and Multivariate analysis for disease-free 
survival in all patients (n = 1,427). Table S8: Model for Uni- and 
Multivariate analysis for disease-free survival in LNN & ER-positive patients 
(n = 552). 

Additional file 3: Tables S9-S15 Table S9A & B: TWIST! co -expressed 
genes measured on Affymetrix U133A gene-chips in LNN/ER-positive 
patients. Table S10: Significant pathways predicted from KEGG database 
in LNN/ER-positive patients. Table S1 1 : Gene ontology enrichment terms 
from TWIST! co-regulated genes based on their biological processes and 
cellular components. Table S12A & B: TWIST! co-expressed genes 
measured on Affymetrix U133A gene-chips in LNN/ER-negative patients. 
Table SI 3: Significant pathways predicted from KEGG database in LNN/ 
ER-negative patients. Table S14: Significant pathways predicted from 
KEGG database in all LNN patients (both ER-positive and ER-negative 
combined). Table SI 5: Correlation of TWIST! expression with known 
EMT markers measured on Affymetrix array in LNN/ER-positive samples (n 
= 221). 
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PCR: reverse transcriptase polymerase chain reaction; SP: stromal poor; SR: 
stromal rich; TWIST1: Twist-related protein 1: also known as a basic helix- 
loop-helix protein 38 (bHLHa38); VIM: vimentin; ZEB1: zinc finger e-box- 
binding homeobox 1; ZEB2: zinc finger e-box-binding homeobox 2. 
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